Non-linear Speech Transition Visualization

نویسندگان

Klaus Reinhard

Mahesan Niranjan

چکیده

Modelling context eeects and segmental transitions in speech recognition systems is very important. Explicitly modelling segmental transitions in a RNN framework would circumvent these problems. We present an interesting application of Principal Curves, an algorithm to extract a non-linear summary of p-dimensional data rstly published in 1989 by Hastie/Stuetzle. The algorithm can be used to visualize non-linear transient characteristics in speech. We will show that between-phone characteristics found within diphones can be used as discriminant information to distinguish ambiguous phones. The technique used is explained and illustrated on the examples /bah/, /dah/ and /gah/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech recognition under musical environments using kalman filter and iterative MLLR adaptation

In this paper, we propose a speech recognition method under non-stationary musical environments using Kalman ltering speech signal estimation method and iterative unsupervised MLLR(Maximum Likelihood Linear Regression) adaptation. Our proposing method estimates the speech signal under non-stationary noisy environments such a s m usical background by applying speech state transition model to Kal...

متن کامل

Synthesizing multimodal utterances for conversational agents

Conversational agents are supposed to combine speech with non-verbal modalities for intelligible multimodal utterances. In this paper, we focus on the generation of gesture and speech fromXML-based descriptions of their overt form. An incremental production model is presented that combines the synthesis of synchronized gestural, verbal, and facial behaviors with mechanisms for linking them in f...

متن کامل

Gene Time Echipression Warper: a tool for alignment, template matching and visualization of gene expression time series

UNLABELLED An application tool for alignment, template matching and visualization of gene expression time series is presented. The core algorithm is based on dynamic time warping techniques used in the speech recognition field. These techniques allow for non-linear (elastic) alignment of temporal sequences of feature vectors and consequently enable detection of similar shapes with different pha...

متن کامل

Utilizing Kernel Adaptive Filters for Speech Enhancement within the ALE Framework

Performance of the linear models, widely used within the framework of adaptive line enhancement (ALE), deteriorates dramatically in the presence of non-Gaussian noises. On the other hand, adaptive implementation of nonlinear models, e.g. the Volterra filters, suffers from the severe problems of large number of parameters and slow convergence. Nonetheless, kernel methods are emerging solutions t...

متن کامل

Enhanced harmonic coding of speech with frequency domain transition modelling

A major source of audible distortion in current low-bit-rate harmonic speech coding algorithms is the ineffective modeling of the transitional speech signals such as onsets, plosives etc.. A new method of modeling transitional speech based on a frequency domain approach is introduced in this paper. The approach uses a modified harmonic model able to produce non-periodic pulse sequences in conju...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1997

Non-linear Speech Transition Visualization

نویسندگان

چکیده

منابع مشابه

Speech recognition under musical environments using kalman filter and iterative MLLR adaptation

Synthesizing multimodal utterances for conversational agents

Gene Time Echipression Warper: a tool for alignment, template matching and visualization of gene expression time series

Utilizing Kernel Adaptive Filters for Speech Enhancement within the ALE Framework

Enhanced harmonic coding of speech with frequency domain transition modelling

عنوان ژورنال:

اشتراک گذاری